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METHOD OF INCREASING YIELD OF_MMLURE~BROJ.E J^ 

MAMMALIAN CELLS 

This invention relates generally to an improved 
method of producing mature proteins in mammalian cells, 
and more specifically, to a method of enhancing or 
increasing the extent of functional polypeptides, thereby 
increasing yields of mature biologically active proteins. 



Background of the Invention 

10 Many eukaryotic proteins are naturally 

synthesized as larger precursor polypeptides, requiring 
further specific proteolytic processing for full 
maturation prior to secretion. In many cases, this 
processing is also essential for full biological activity 

15 of the mature protein. Cleavage of these precursors 

frequently occurs at sites marked by paired basic amino 
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acid residues, e.g. Lys-Arg and Arg-Arg. [Dickerson et 
al, J . Biol . Chem. , 265 : 2462 (1990); Achsletter et al, 
EMBO J. , 4:173 (1985); Mizuno et al, Biochem. Biophys. 
Res . Commun . , 144 : 807 (1987)]. \^ 

Cleavage at the site of a paired basic aminb 
acid sequence removes many propeptides which function in/ 
a variety of roles in the processing of the mature 
protein. In certain cases the propeptide can mediate 
correct folding and disulfide bond formation within the 
protein sequence. In other cases the presence of the 
propeptide appears to be involved in y-carboxylation of 
glutamic acid residues in vitamin K-dependent coagulation 
factors. Y~ car k° x Y lated proteins include Factor IX and 
Protein C, and certain bone-specific proteins, such as 
bone Gla protein/osteocalcin. The propeptide can also 
direct intracellular targeting and regulate the 
coordinate synthesis of multiple mature peptides from a 
single precursor polypeptide. 

The sequences of the propeptide domains of 
certain vitamin K-dependent blood coagulation proteins 
have been published [See, Furie et al, Cell . 53 : 505 
(1988)] and the size of the propeptide has been 
established for both Factor IX and Protein C. 
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Factor IX is a zymogen of a serine protease 
that is an important component of the intrinsic pathway 
of the blood coagulation cascade. The protein is 
synthesized in the liver and undergoes extensive co- and 

5 post-translational modification prior to secretion. 

These modifications involve endoproteolytic processing to 
remove the pre- and pro-peptides, glycosylation, vitamin 
K-dependent y-carboxylation °f 12 amino-terminal glutamic 
acid residues and /3-hydroxylation of a single aspartic 

0 acid residue. 

The y-carboxyglutamic acid residues confer 
metal binding properties on the mature Factor IX protein 
and may function similarly in the processing of the other 
vitamin K-dependent blood clotting proteins. These y- 

5 carboxyglutamic acid residues are essential for coagulant 

activity. The gamma-carboxyglutamate (GLA) domain of 
Factor IX has also been identified as a major requirement 
for cell binding [Derian et al, J. Biol . Chem. , 
264 (12) : 6615-6618 (1989) ] . 

0 With the advance of genetic engineering, many 

eukaryotic proteins are being produced recombinantly in 
selected cell lines, particularly mammalian cell lines. 
For example, Chinese Hamster Ovary (CHO) DUKX cell lines 
producing recombinant Factor IX at high antigen levels 



(20 jug/ml/day) have been isolated. However, only 1-2% of 
that recombinant protein is Y" carbox yl a ted, and therefore 
biologically active, in the presence of vitamin K3 
[Kaufman et al, J. Biol. Chem. , 26JL (21) : 9622-28 (1986)]. 
Additionally, amino-terminal sequencing of the 
recombinant protein has found that 50% of the recombinant 
Factor IX produced by the CHO cells retain the propeptide 
[Derian et al, J. Biol . Chem. , 264(12): 6615-18 (1989)]. 
Presumably, the endoproteolytic processing enzyme of the 
CHO cells directing this cleavage was either saturated or 
simply inefficient in its function. 

Despite the fact that several processing 
enzymes have been proposed as being involved in the 
propeptide processing reactions, the enzyme or enzymes 
responsible for these endoproteolytic cleavages in 
mammalian cells have not been fully characterized. 

The purification of proprotein cleavage enzymes 
has been hampered by their low levels of activity in 
mammalian tissue and by their membrane-associated nature. 
Purification of these specific proteases has been 
complicated additionally by non-specific cleavage of the 
assay substrates in vitro , and by contaminating proteases 
such as those released from lysosomes. 



The yeast enzyme Kex2 is a membrane-bound f Ca ++ - 
dependent serine protease which functions late in the 
secretory pathway of Saccharomyces cerevisiae , cleaving 
the polypeptide chains of prepro-killer toxin and prepro- 
a-factor at the paired basic amino acid sequences of Lys- 
Arg and Arg-Arg. [Julius et al, Cell , 3_7:1075 (1984); 
Julius et al, Cell , 36:309 (1984)]. 

When expressed in mammalian cells, yeast Kex2 
endopeptidase reportedly cleaved a neuroendocrine 
prohormone [Thomas et al, Science , 241 : 226-230 (1988)]. 
Foster et al, Thrombosis and Haemostasis . 62^:321 (1989) 
have reported that the yeast gene product of Kex2 cleaves 
the Protein C precursor to a 2 -chain form when the yeast 
endoprotease of the Kex2 gene and the wild-type Protein C 
precursor are coexpressed. However, propeptide 
processing and the effect of Kex2 expression have not 
been studied. 

Recently, a human insulinoma cDNA encoding a 
mammalian subtilisin-like protease, designated PC2 , has 
been implicated in the endoproteolytic processing of 
prohormones based on its homology to the yeast Kex2 
protease [Smeekens et al, J. Biol . Chem. , 265 : 2997 
(1990)]. To date, however, no functional activity has 
been demonstrated for the PC2 clone. 
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The availability of the complete Kex2 gene 
sequence also allowed the detection of significant 
homology between the Kex2 protein and "fur in" , the 
product of the partially characterized human fur gene, a 
5 gene in the immediate upstream region of the c-fes/fps 

proto-oncogene [Roebroek et al, EMBO J. , 5:2197 (1986)]. 
The complete nucleotide sequence of the putative coding 
region of the fur gene has been reported. Upon 
comparison, the human fur gene product has demonstrated 

10 structural homology with the subtilisin-type serine 

protease encoded by the Kex2 gene of the yeast S . 
cerevisiae [van den Ouweland et al, Nucl . Acids Res. , 
18(3): 664 (1990)]. However, no evidence of the 
expression of fur was reported. 

15 There remains a need in the art for a method of 

increasing the efficiency of proteolytic processing of 
precursor polypeptides in mammalian cells. 




Summary of the Invention 

In one aspect, the present invention provides a 
2 0 method for increasing the efficiency of, or otherwise 

enhancing the production of, a functional, mature 
protein, which protein requires processing of a pro- 
peptide form for biological activity. The method may be 



used for the production of y-carboxylated proteins. The 
invention may also be used for the processing of other 
proteins, not requiring gamma carboxylation, leading to 
higher levels of biologically active or otherwise useful 
proteins . 

The method may be performed by transfection 
into a selected host cell line of one or more expression 
vectors containing a paired basic amino acid cleaving 
enzyme (PACE) DNA sequence (SEQ ID NO: 1) and a DNA 
sequence encoding the selected proprotein, each sequence 
operably linked to a heterologous expression control 
sequence, or by transfection of the PACE DNA (SEQ ID NO: 
1) into a host cell line known to express the desired 
protein or by transfection of a DNA for the desired 
protein into a cell known to express PACE (SEQ ID NO: 2) . 

Of use in the present invention is a 
recombinant DNA molecule comprising a DNA sequence 
encoding PACE (SEQ ID NO: 1) or a homolog thereof. The 
DNA molecule provides the PACE DNA (SEQ ID NO 1) in 
operative association with a regulatory sequence capable 
of directing the replication and expression of PACE (SEQ 
ID NO: 2) in a selected host cell. 

Another aspect of the invention includes a 
recombinant DNA molecule comprising a DNA sequence 
encoding PACE (SEQ ID NO: 1) and a DNA sequence encoding 
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a selected proprotein requiring complete processing for 
biological activity, both DNA sequences being in 
operative association with one or more heterologous 
regulatory sequences capable of directing the replication 
and expression of PACE (SEQ ID NO: 2) and the selected 
propeptide in a selected host cell. In one embodiment 
the selected proprotein requires Y~ carbox Ylation for 
activity. 

In a further aspect the present invention 
provides a host cell containing and capable of expressing 
DNA sequences encoding PACE (SEQ ID NO: 1) and a selected 
precursor polypeptide which is capable of producing high 
levels of active, mature protein. The cell line may be 
transfected with the recombinant DNA molecule (s) 
described above. This cell line may be cultured under 
appropriate conditions permitting expression of the 
recombinant DNA. The expressed selected protein is then 
harvested from the host cell or culture medium by 
suitable conventional means. This claimed process may 
employ a number of known eukaryotic, preferably mammalian 
cells, as host cells for expression of the protein. 

Other aspects and advantages of this invention 
are apparent from the following detailed description of 
the invention. 
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Detailed Description of the Invention 

PACE, an acronym for paired basic amino acid 
cleaving enzyme, is a propeptide-cleaving enzyme 
originally isolated from a human liver cell line. A DNA 
5 sequence encoding PACE (SEQ ID NO: 1) (or furin) was 

published in A.M.W. van den Ouweland et al, Nucl . Acids 
Res. . 18(3) :664 (1990), and is reported below in Table I. 

It should be understood that the enzyme PACE 
(SEQ ID NO: 2) as described herein may be encoded by DNA 

10 sequences that differ in sequence from this published 

sequence (SEQ ID NO: 1) due to natural allelic variations 
or synthetically produced modifications* Provided that 
the biological activities of mediating propeptide 
cleavage and y-carboxylation are retained in whole or 

15 part despite such modifications, this invention 

encompasses the use of all such DNA sequences. The term 
"PACE 11 as used herein thus encompasses the peptide and 
DNA sequences specifically disclosed herein as well as 
analogs thereof retaining PACE biological activity. 

20 Expression of PACE (SEQ ID NO: 2) in host cells 

can improve the efficiency of cleavage of a proprotein 
between the dibasic residues Lys-Arg, Lys-Lys or Arg-Arg 
into its mature form, resulting in high level expression 
of the mature protein. Host cells for this expression 

2 5 include preferably mammalian cells for expression of 
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mammalian proteins. The inventors have now surprisingly 
discovered that co-expression of PACE (SEQ ID NO: 2) with 
proteins requiring y~ carb °xylation for biological 
activity permits the expression of increased yields of 
5 functional, biologically active mature proteins in 

eukaryotic, preferably mammalian, cells. The 
establishment of cell lines which express PACE (SEQ ID 
NO: 2) provides a convenient and efficient mechanism for 
the high level production of more completely processed 
10 and biologically active proteins. 
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Table I 

Published coding sequence of PACE (furin) (SEQ ID NO: 1)/ 
(SEQ ID NO: 2) 

ATG GAG CTC AGG CCC TGG TTC CTA TGG GTC GTA CCA CCA 39 

5 Met Glu Leu Arg Pro Trp Leu Leu Trp Val Val Ala Ala 

5 10 

ACA GGA ACC TTG GTC CTG CTA GCA GCT GAT GCT CAG GGC 78 

Thr Gly Thr Leu Val Leu Leu Ala Ala Asp Ala Gin Gly 
15 20 25 

10 CAG AAG GTC TTC ACC AAC ACG TGG GCT GTG CGC ATC CCT 117 

Gin Lys Val Phe Thr Asn Thr Trp Ala Val Arg lie Pro 
30 35 

GGA GGC CCA GCG GTG GCC AAC AGT GTG GCA CGG AAG CAT 15 6 

Gly Gly Pro Ala Val Ala Asn Ser Val Ala Arg Lys His 
15 40 45 50 

GGG TTC CTC AAC CTG GGC CAG ATC TTC GGG GAC TAT TAC 195 

Gly Phe Leu Asn Leu Gly Gin lie Phe Gly Asp Tyr Tyr 

55 60 65 

CAC TTC TGG CAT CGA GGA GTG ACG AAG CGG TCC CTG TCG 2 34 

2 0 His Phe Trp His Arg Gly Val Thr Lys Arg Ser Leu Ser 

70 75 

CCT CAC CGC CCG CGG CAC AGC CGG CTG CAG AGG GAG CCT 273 

Pro His Arg Pro Arg His Ser Arg Leu Gin Arg Glu Pro 
80 85 90 

2 5 CAA GTA CAG TGG CTG GAA CAG CAG GTG GCA AAG CGA CGG 312 

Gin Val Gin Trp Leu Glu Gin Gin Val Ala Lys Arg Arg 
95 100 

ACT AAA CGG GAC GTG TAC CAG GAG CCC ACA GAC CCC AAG 3 51 

Thr Lys Arg Asp Val Tyr Gin Glu Pro Thr Asp Pro Lys 
30 105 110 115 

TTT CCT CAG CAG TGG TAC CTG TCT GGT GTC ACT CAG CGG 3 90 

Phe Pro Gin Gin Trp Tyr Leu Ser Gly Val Thr Gin Arg 

120 125 130 



GAC CTG AAT GTG AAG GCG GCC TGG GCG CAG GGC TAC ACA 
3 5 Asp Leu Asn Val Lys Ala Ala Trp Ala Gin Gly Tyr Thr 

135 140 



429 
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GGG CAC GGC ATT CTG GTC TCC ATT CTG GAC GAT GGC ATC 4 68 

Gly His Gly lie Val Val Ser lie Leu Asp Asp Gly lie 
145 150 155 

GAG AAG AAC CAC CCC GAC TTG GCA GGC AAT TAT GAT CCT 507 
5 Glu Lys Asn His Pro Asp Leu Ala Gly Asn Tyr Asp Pro 

160 165 

GGG GCC AGT TTT CAT GTC AAT GAC CAG GAC CCT GAC CCC 54 6 
Gly Ala Ser Phe Asp Val Asn Asp Gin Asp Pro Asp Pro 
170 175 180 

10 CAG CCT CGG TAC ACA CAG ATG AAT GAC AAC AGG CAC GGC 58 5 

Gin Pro Arg Tyr Thr Gin Met Asn Asp Asn Arg His Gly 
185 190 195 

ACA CGG TGT GCG GGG GAA GTG GCT GCC GTG GCC AAC AAC 62 4 
Thr Arg Cys Ala Gly Glu Val Ala Ala Val Ala Asn Asn 
15 200 205 

CGT GTC TGT GGT GTA GGT GTG GCC TAC AAC GCC CGC ATT 663 
Gly Val Cys Gly Val Gly Val Ala Tyr Asn Ala Arg lie 
210 215 220 

GGA GGG GTC CGC ATG CTG GAT GGC GAG GTG ACA GAT GCA 7 02 
2 0 Gly Gly Val Arg Met Leu Asp Gly Glu Val Thr Asp Ala 

225 230 

GTG GAG GCA CGC TCG CTG GGC CTG AAC CCC AAC CAC ATC 741 
Val Glu Ala Arg Ser Leu Gly Leu Asn Pro Asn His lie 
235 240 245 

2 5 CAC ATC TAC AGT GCC AGC TGG GGC CCC GAG GAT GAC GGC 78 0 

His lie Tyr Ser Ala Ser Trp Gly Pro Glu Asp Asp Gly 
250 255 260 

AAG ACA GTG GAT GGG CCA GCC CGG CTC GCC GAG GAG GCC 819 
Lys Thr Val Asp Gly Pro Ala Arg Leu Ala Glu Glu Ala 
30 265 270 

TTC TTC CGT GGG CTT AGC CAG GGC CGA GGG GGG CTG GGC 858 
Phe Phe Arg Gly Val Ser Gin Gly Arg Gly Gly Leu Gly 
275 280 285 

TCC ATC TTT GTC TGG GCC TCG GGG AAC GGG GGG CGG GAA 897 

3 5 Ser lie Phe Val Trp Ala Ser Gly Asn Gly Gly Arg Glu 

290 295 

CAT GAC AGC TGC AAC TGC GAC GGC TAC ACC AAC AGT ATC 93 6 
His Asp Ser Cys Asn Cys Asp Gly Tyr Thr Asn Ser lie 
300 305 310 
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TAC ACG CTG TCC ATC AGC AGC GCC ACG CAG TTT GGC AAC 97 5 
Tyr Thr Leu Ser lie Ser Ser Ala Thr Gin Phe Gly Asn 
315 320 325 

GTG CCG TGG TAC AGC GAG GCC TGC TCG TCC ACA CTG GCC 1014 
5 Val Pro Trp Tyr Ser Glu Ala Cys Ser Ser Thr Leu Ala 

330 335 

ACG ACC TAC AGC AGT GGC AAC CAG AAT GAG AAG CAG ATC 10 53 
Thr Thr Tyr Ser Ser Gly Asn Gin Asn Glu Lys Gin lie 
340 345 350 

10 GTG ACG ACT GAC TTG CGG CAG AAG TGC ACG GAG TCT CAC 1092 

Val Thr Thr Asp Leu Arg Gin Lys Cys Thr Glu Ser His 
355 360 

ACG GGC ACC TCA GCC TCT GCC CCC TTA GCA GCC GGC ATC 1131 
Thr Gly Thr Ser Ala Ser Ala Pro Leu Ala Ala Gly lie 
15 365 370 375 

ATT GCT CTC ACC CTG GAG GCC AAT AAG AAC CTC ACA TGG 117 0 
lie Ala Leu Thr Leu Glu Ala Asn Lys Asn Leu Thr Trp 
380 385 390 

CGG GAC ATG CAA CAC CTG GTG GTA CAG ACC TCG AAG CCA 12 09 
2 0 Arg Asp Met Gin His Leu Val Val Gin Thr Ser Lys Pro 

395 400 

GCC CAC CTC AAT GCC AAC GAC TGG GCC ACC AAT GGT GTG 12 4 8 
Ala His Leu Asn Ala Asn Asp Trp Ala Thr Asn Gly Val 
405 410 415 

2 5 GGG CGG AAA GTG AGC CAC TCA TAT GGC TAC GGG CTT TTC 12 87 

Gly Arg Lys Val Ser His Ser Tyr Gly Tyr Gly Leu Leu 
420 425 

GAC GCA GGC GCC ATG GTG GCC CTG GCC CAG AAT TGG ACC 13 2 6 
Asp Ala Gly Ala Met Val Ala Leu Ala Gin Asn Trp Thr 
30 430 435 440 

ACA GTC GCC CCC CAG CGG AAG TGC ATC ATC GAC ATC CTC 13 65 
Thr Val Ala Pro Gin Arg Lys Cys lie lie Asp lie Leu 
445 450 455 

ACC GAG CCC AAA GAC ATC GGG AAA CGG CTC GAC CTC CGG 14 04 

3 5 Thr Glu Pro Lys Asp lie Gly Lys Arg Leu Glu Val Arg 

460 465 

AAC ACC GTG ACC GCG TCC CTG GGC GAG CCC AAC CAC ATC 14 4 3 
Lys Thr Val Thr Ala Cys Leu Gly Glu Pro Asn His lie 
470 475 480 
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ACT CGG CTG GAG CAC GCT CAG GCG CGG CTC ACC CTG TCC 14 8 2 
Thr Arg Leu Glu His Ala Gin Ala Arg Leu Thr Leu Ser 
485 490 

TAT AAT CGC CGT GGC GAC CTG GCC ATC CAC CTG GTC AGC 1521 
5 Tyr Asn Arg Arg Gly Asp Leu Ala lie His Leu Val Ser 

495 500 505 

CCC ATG GGC ACC CGC TCC ACC CTG CTG GCA GCC AGG CCA 15 6 0 
Pro Met Gly Thr Arg Ser Thr Leu Leu Ala Ala Arg Pro 
510 515 520 

10 CAT GAC TAC TCC GCA GAT GGG TTT AAT GAC TGG GCC TTC 159 9 

His Asp Tyr Ser Ala Asp Gly Phe Asn Asp Trp Ala Phe 

525 530 

ATG ACA ACT CAT TCC TGG GAT GAC GAT CCC TCT GGG GAG 163 8 
Met Thr Thr His Ser Trp Asp Glu Asp Pro Ser Gly Glu 
15 535 540 545 

TGG GTC CTA GAG ATT GAA AAC ACC AGC GAA GCC AAC AAC 167 7 
Trp Val Leu Glu lie Glu Asn Thr Ser Glu Ala Asn Asn 

550 555 

TAT GGG ACG CTG ACC AAC TCC ACC CTC GTA CTC TAT GGC 1716 
2 0 Tyr Gly Thr Leu Thr Lys Phe Thr Leu Val Leu Tyr Gly 

560 565 570 

ACC GCC CCT GAC GGG CTC CCC GTA CCT CCA GAA AGC AGT 17 5 5 
Thr Ala Pro Glu Gly Leu Pro Val Pro Pro Glu Ser Ser 
575 580 585 

2 5 GGC TGC AAG ACC CTC ACG TCC AGT CAG GCC TGT GTG GTG 17 9 4 

Gly Cys Lys Thr Leu Thr Ser Ser Gin Ala Cys Val Val 

590 595 

TGC GAG GAA GGC TTC TCC CTC CAC CAG AAG AGC TGT GTC 18 3 3 
Cys Glu Glu Gly Phe Ser Leu His Gin Lys Ser Cys Val 
30 600 605 610 

CAG CAC TGC CCT CCA GGC TTC GCC CCC CAA GTC CTC GAT 18 7 2 
Gin Asn Cys Pro Pro Gly Phe Ala Pro Gin Val Leu Asp 
615 620 

ACG CAC TAT AGC ACC GAG AAT GAC GTG GAG ACC ATC CGG 1911 

3 5 Thr Asn Tyr Ser Thr Glu Asn Asp Val Glu Thr lie Arg 

625 630 635 



GCC AGC GTC 
Ala Ser Val 
640 



TGC GCC CCC TGC CAC GCC TCA TGT GCC ACA 
Cys Ala Pro Cys His Ala Ser Cys Ala Thr 

645 650 



1950 
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TGC CAG GGG CCG GCC CTG ACA GAC TGC CTC AGC TGC CCC 1989 
Cys Gin Gly Pro Ala Leu Thr Asp Cys Leu Ser Cys Pro 

655 660 

AGC CAC GCC TCC TTG GAC CCT GTG GAG CAG ACT TGC TCC 2 02 8 
5 Ser His Ala Ser Leu Asp Pro Val Glu Gin Thr Cys Ser 

665 670 675 

CGG CAA AGC CAG AGC AGC CGA GAG TCC CCG CCA CAG CAG 2 067 
Arg Gin Ser Gin Ser Ser Arg Glu Ser Pro Pro Gin Gin 
680 685 

10 CAG CCA CCT CGG CTG CCC CCG GAG GTG GAG GCG GGG CAA 2106 

Gin Pro Pro Arg Leu Pro Pro Glu Val Glu Ala Gly Gin 

690 695 700 

CGG CTG CGG GCA GGG CTG CTG CCC TCA CAC CTG CCT GAG 214 5 

Arg Leu Arg Ala Gly Leu Leu Pro Ser His Leu Pro Glu 

15 705 710 715 

GTG GTG GCC GGC CTC AGC TGC GCC TTC ATC GTG CTG GTC 2184 
Val Val Ala Gly Leu Ser Cys Ala Phe lie Val Leu Val 

720 725 

TTC GTC ACT GTC TTC CTG CTC CTG CAG CTG CGC TCT GGC 2 2 23 
2 0 Phe Val Thr Val Phe Leu Val Leu Gin Leu Arg Ser Gly 

730 735 740 

TTT AGT TTT CGG GGG GTG AAG GTG TAC ACC ATG GAC CGT 2 2 62 
Phe Ser Phe Arg Gly Val Lys Val Tyr Thr Met Asp Arg 
745 750 

2 5 GGC CTC ATC TCC TAC AAG GGG CTG CCC CCT GAA GCC TGG 2 3 01 

Gly Leu lie Ser Tyr Lys Gly Leu Pro Pro Glu Ala Trp 
755 760 765 

CAG GAG GAG TGC CCG TCT GAC TCA GAA GAG GAC GAG GGC 2 34 0 
Gin Glu Glu Cys Pro Ser Asp Ser Glu Glu Asp Glu Gly 
30 770 775 780 

CGG GGC GAG AGG ACC GCC TTT ATC AAA GAC CAG AGC GCC 2 3 79 
Arg Gly Glu Arg Thr Ala Phe lie Lys Asp Gin Ser Ala 

785 790 



CTC TGA 
3 5 Leu End 

795 



2385 
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It is presently and theoretically contemplated 
that the specific mechanism underlying enhanced 
expression of function Y~ car koxylated proteins resides in 
the expression of DNA encoding PACE in mammalian cells 
which increases the efficiency of Y~ carbox y lat i° n > a 
post-translational modification required for biological 
activity of certain mature proteins. The method is 
especially useful in the processing of vitamin In- 
dependent blood coagulation proteins. More specifically 
the method is useful in processing and y-carboxylating 
other proteins including Protein C, Protein S, 
Prothrombin Factor IX, Factor VII, Factor X and bone y- 
carboxyglutamate protein. For example, co-expression 
with PACE (SEQ ID NO: 2) with such a propeptide permits 
high level recombinant expression of biologically active 
mature proteins. 

In addition, high levels of recombinant 
expression of functional proteins can also be achieved by 
use of the present method by expressing PACE (SEQ ID NO: 
2) with more completely processed proteins expressed from 
other genes. For example, coexpression of PACE (SEQ ID 
NO: 2) with non-Vitamin K dependent propeptides which 
require cleavage but not y-carboxylation for biological 
activity may produce high yields of functional mature 
proteins . 



# 
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One such protein which may be expressed in high 
functional yields by the present method is bone 
morphogenic protein (BMP), particularly BMP-2 [see, e.g., 
E. Wang et al, Proc. Natl. Acad. Sci. USA , 87 : 2220-2224 
(1990), which is incorporated by reference herein for 
information about that protein] . Other such proteins 
which may be produced in high functional yields by the 
present invention include tumor growth factor (3 (TGF-/3) 
and platelet-derived growth factor (PDGF) , among others. 

Further, the present invention also encompasses 
the use of recombinant-derived PACE (SEQ ID NO: 2) for in 
vitro processing of nerve growth factor and monobasic 
propiomelanocortin. PACE (SEQ ID NO: 2) may also be 
useful in the processing of proteins, such as insulin, 
and for the maturation of viruses, such as HIV and 
Hepatitis C, which also require precursor processing at 
paired basic amino acid residues. 

Transfection of a DNA sequence encoding PACE 
(SEQ ID NO: 1) and a DNA sequence for a selected 
propeptide precursor into a mammalian cell can be 
effected via one or more recombinant vectors carrying 
PACE (SEQ ID NO: 1) , the mammalian propeptide, or both, 
using materials and methods conventional in heterologous 
gene expression in mammalian cells. 
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Host cells transformed with the one or more 
vectors carrying the PACE DNA (SEQ ID NO: 1) and the 
selected precursor DNA are selected, e.g. by conventional 
means, and may then be cultured under suitable 
conditions if desired, with amplification of one or both 
introduced genes. The method of this present invention 
therefore comprises culturing a suitable cell or cell 
line, which has been transformed with a DNA sequence 
coding for PACE (SEQ ID NO: 1) and a DNA sequence coding 
for the selected precursor, each coding sequence under 
the control of a transcriptional regulatory sequence. 
The expressed mature protein is then recovered, isolated 
and purified from the culture medium (or from the cell, 
if expressed intracellularly ) by appropriate means known 
to one of skill in the art. 

Suitable cells or cell lines for this method 
are mammalian cells, such as Chinese hamster ovary cells 
(CHO) , the monkey COS-1 cell line or murine 3T3 cells 
derived from Swiss, Balb-c or NIH mice. The selection of 
suitable mammalian host cells and methods for 
transformation, culture, amplification, screening and 
product production and purification are known in the art. 
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See, e.g., Gething and Sambrook, Nature, 293 : 620-625 
(1981), or alternatively, Kaufman et al, Mol . Cell. 
Biol. . 5(7) : 1750-1759 (1985) or Howley et al, U. S. 
Patent 4,419,446. Another suitable mammalian cell line 
is the CV-1 cell line. 

Further exemplary mammalian host cells include 
particularly primate cell lines and rodent cell lines, 
including transformed cell lines. Normal diploid cells, 
cell strains derived from in vitro culture of primary 
tissue, as well as primary explants, are also suitable. 
Candidate cells may be genotypically deficient in the 
selection gene, or may contain a dominantly acting 
selection gene. Other suitable mammalian cell lines 
include but are not limited to, HeLa, human adenovirus 
transformed 293 cells, mouse L-929 cells, BHK or HaK 
hamster cell lines. 

The present invention also provides recombinant 
DNA molecules, or vectors, for use in the method of 
expression of active mature proteins, such as those 
described above. A single vector can carry the PACE DNA 
(SEQ ID NO: 1) and another vector can carry the selected 
precursor DNA, each under the control of a selected 
expression control sequence. Alternatively, both the 
PACE (SEQ ID NO: 1) and precursor DNA sequences may be 
carried on a single recombinant vector molecule in which 



# 
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case they may be operably linked to respective expression 
control sequences or may share a common expression 
control sequence. In general, the vectors employed will 
contain selected regulatory sequences in operative 
5 association with the DNA coding sequences of PACE (SEQ ID 

NO: 1) and selected precursor and capable of directing 
the replication and expression thereof in selected host 
cells. 

The vector used in the examples below is pMT3 , 
10 a derivative of the previously described vector pMT2 [R. 

Kaufman, Mol. Cell. Biol. , 9:946-958 (1989)]. The 
mammalian cell expression vectors described herein may be 
synthesized by techniques well known to those skilled in 
this art. The components of the vectors, e.g. repl icons, 
15 selection genes, enhancers, promoters, and the like, may 

be obtained from natural sources or synthesized by known 
procedures. [See, Kaufman et al, J. Mol . Biol . . 159:511- 
521 (1982) ; and Kaufman, Proc. Natl. Acad. Sci., USA , 
8 2 : 689-693 (1985) ] . 
2 0 Alternatively, the vector DNA may include all 

or part of the bovine papilloma virus genome [Lusky et 
al, Cell, 3.6:391-401 (1984)] and be carried in cell lines 
such as C127 mouse cells as a stable episomal element. 
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The transf orination of these vectors into appropriate host 
cells can result in expression of the selected mature 
proteins. Other appropriate expression vectors of which 
numerous types are known in the art for mammalian 
5 expression can also be used for this purpose. 

The following examples illustratively describe 
the construction of plasmids for the expression and 
production of PACE (SEQ ID NO: 2) in mammalian cells, and 
the co-expression of PACE (SEQ ID NO: 2) and the blood 
10 coagulation factor, Factor IX, in mammalian cells. These 

examples are for illustration and do not limit the scope 
of the present invention. 



Example 1 - Plasmid Construction and Expression of PACE 
cDNA in COS-1 Cells 

15 A 2.4 7 kbp PACE cDNA fragment [Chiron 

Corporation, California] is employed, which includes the 
published 794-codon PACE coding sequence and 74 bases of 
3 1 -untranslated sequence before a Sail linker [A.M.W. van 
den Ouweland et al, cited above] (SEQ ID NO: 1)/(SEQ ID 

2 0 NO: 2). At the 5' -end, the sequence immediately 

preceding the ATG was modified to conform to the 
consensus translation start site using a EcoRI 
oligonucleotide adapter. 
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The 2.47 kbp (EcoRI-Sall) PACE cDNA fragment 
(SEQ ID NO: 1) was inserted into the SV40-based 
expression vector pMT3 to generate the plasmid pMT3-PACE. 
pMT3 has been deposited with the American Type Culture 
Collection (ATCC) , Rockville, MD (USA) under Accession 
Number ATCC 40348. The pMT3 vector is a derivative of 
the previously described vector pMT2 [Kaufman, cited 
above] starting with pMT2-vWF, which is deposited at the 
American Type Culture Collection, Rockville, MD (USA) , 
Accession Number ATCC #67122; see PCT application 
PCT/US87/00033 ] . To form pMT3 , the DHFR coding region on 
the 3' side of the cloning site in pMT2 is removed. 

One skilled in the art can also construct other 
mammalian expression vectors comparable to the pMT3/PACE 
vector by, e.g. inserting the DNA sequence of PACE (SEQ 
ID NO: 1) from pMT3 into another vector, such as pJL3 , 
pJL4 [Gough et al . , EMBO J. , 4: 645-653 (1985)], employing 
well-known recombinant genetic engineering techniques. 

pMT3-PACE was purified and introduced for 
transient expression into SV4 0-transf ormed monkey kidney 
cells (COS-1) using a calcium phosphate transfection 
protocol as described in Chen, C. A., and Okayama, H. , 
BioTechniques , 6:632-638 (1988). Cells were transfected 
with 40 fig of plasmid or, in the case of co- 
transf ections, an equimolar ratio of plasmids totalling 
60 /xg per 10 cm dish in 10 ml of medium. 
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Mammalian host cells other than COS cells may 
also be employed in PACE (SEQ ID NO: 2) expression. For 
example, preferably for stable integration of the vector 
DNA, and for subsequent amplification of the integrated 
5 vector DNA, both by conventional methods, CHO may be 

employed as a mammalian host cell of choice. 

To monitor PACE (SEQ ID NO: 2) synthesis, COS-1 
cell products were radiolabeled 48-60 hours following 
transfection using 35 S-Met and 35 S-Cys in medium lacking 

10 Cys and Met. Cells were lysed in NP-40 lysis buffer 

after a 3 0 minute pulse period or were chased by removing 
the labeling medium and replacing it with complete medium 
for additional incubation. Cell extracts and conditioned 
medium were treated with protease inhibitors and 

15 immunoprecipitated as described in Wise et al, Cell, 

52 : 229-236 (1988) . 

Immunoprecipitates were performed with rabbit 
anti-PACE antiserum produced against a PACE- E. coli 
fusion protein. Rabbit anti-PACE antiserum was generated 

2 0 against the catalytic domain of PACE by expression of 

amino acids 146 to 372 of PACE (SEQ ID NO: 2) as a human 
superoxide dismutase (SOD) fusion protein in E. coli . 
The DNA fragment for expression was generated by 
polymerase chain reaction (PCR) and cloned into the 

2 5 superoxide dismutase (SOD) fusion vector pTAC7 [Steimer 

et al, J. Virol. . 58:9 (1986)]. 



• 
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The induced fusion protein was purified by 
preparative polyacrylamide gel electrophoresis, eluted 
and used to immunize rabbits in complete Freunds 
adjuvant. The immunoprecipitated samples were then 
5 analyzed by SDS-polyacrylamide gel electrophoresis (SDS- 

PAGE) . The gels were prepared for fluorography in 
EnHance (Dupont) . 

In control lysates from COS-1 cells that did 
not receive pMT3-PACE, no immunoreactive proteins were 

10 detected. However, in extracts from pMT3-PACE 

transfected cells, immunoreactive species were detected 
that migrated primarily as a doublet of approximately 90 
kD. Treatment of these PACE immunoprecipitates with the 
endoglycosidase enzyme, N-Glycanase, shifted the 

15 electrophoretic mobility of the PACE (SEQ ID NO: 2) 

consistent with the presence of asparagine-1 inked 
oligosaccharides . 

Secreted products were analyzed from 
conditioned medium following a 12 hour chase period in 

2 0 medium containing an excess of unlabeled amino acids. 

Immunoprecipitations of the conditioned medium from pMT3- 
PACE transfected cells detected an immunoreactive protein 
migrating at 75 kD. The relative quantity of the 75kD 
PACE protein observed in the conditioned medium was 5 to 

25 10 fold less than that remaining inside the cell at the 



12 hour chase period. This secreted species may 
represent a truncated molecule missing the transmembrane 
domain, possibly the result of auto-proteolysis at the 
paired arginine residues, 497-498, due to the large 
overproduction of PACE (SEQ ID NO: 2) in the transfected 
COS-1 cells. 

More extensive pulse-chase experiments 
demonstrated that the PACE translation product does not 
accumulate to high levels inside the cell compared to 
another integral membrane glycoprotein (influenza 
hemagglutinin) when synthesized at similar levels. 

Example 2 - Co-Expression of PACE and Factor IX 

A CHO cell line producing recombinant Factor 
IX (IC4) [the IC4 cell line is described in Kaufman et 
al, J. Biol . Chem. . 261 : 9622-9628 (1986)] and Factor IX 
sequences were transfected with the PACE cDNA (SEQ ID NO 
1) described above in Example 1 operatively linked to 
another amplifiable marker, adenosine deaminase. The 
vector MT3SV2Ada [R.J. Kaufman et al, Meth. Enzvm. , 
185 : 537-566 (1990)] was chosen for PACE expression 
because it contains a selectable ADA transcription unit 
but no DHFR sequences and the PACE fragment could easily 
be inserted after digestion of the vector with EcoRl and 



# 
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A vector fragment was isolated from low melt 
agarose, ligated in a ratio of 5:1 (fragment to vector), 
diluted in 10 mM Tris-HCl, pH 7.5, 1 mM EDTA, and used to 
transform DH5 bacteria [Dr. Douglas Hanahan, Cold Spring 
5 Harbor, New York] . A nick-translated, 32 P labelled PACE 

fragment was prepared and used for filter hybridization 
to screen transformed colonies. 

Positively hybridizing colonies were isolated 
and DNA prepared for digestion with EcoRl and Sail for 

10 confirmation of PACE (SEQ ID NO: 1) insertion and with 

Bgl II for correct orientation of the fragment with 
respect to adenovirus major late promoter in the vector. 

DNA from one colony was isolated for 
electroporation into the Factor IX producing cells, IC4 . 

15 Pools of colonies have been selected for amplification by 

growth in 1.0 /xM 2 ' -deoxycof ormycin (DCF) . The presence 
of PACE (SEQ ID NO: 2) in these amplified lines was 
confirmed by 35 S-methionine labelling and 
immunoprecipitat ion . 

20 Biological activity of the Factor IX protein in 

the PACE/IX pools was analyzed by clotting assay, 
performed as described in Kaufman et al , J . Biol . Chem . , 
261 : 9622-9628 (1986) . Cells were plated in p60 tissue 
culture dishes. The next day medium was reduced (1.5 ml) 

25 and changed to a "defined" + 1 jug/ml Vitamin K3 . 
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The PACE/Factor IX pools were found to secrete 
between 2,0 and 3.1 fold more Factor IX biological 
activity than the original IC4 cell line. The results of 
a radioimmunoassay indicated increased levels of y- 
5 carboxylated protein. These results are illustrated in 

Table I below. 
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From the first electroporation of MT3 PACE Ada 
into IC4 cells, cells were selected in a medium with 10% 
dialyzed fetal calf serum, penicillin, streptomycin, 
glutamine, 200 /xM Methotrexate and Adenosine, alanosine, 
uridine and 0 . 1/iM DCF. Approximately 25 colonies were 
observed in plates that did not receive DNA . 

A second electroporation performed was selected 
in the same manner and approximately 100 colonies were 
pooled into each of the 5 pools. Again, no colonies were 
observed on plates that did not receive DNA. 

Expression of PACE (SEQ ID NO: 2) was detected 
in each pool by 3 0 minute pulse with 35 S Methionine 
followed by 2 hour chase and immunoprecipitation of cell 
extracts with a PACE antibody [Chiron Corporation, 
California] . In cells which express higher levels of 
PACE (SEQ ID NO: 2) as a result of selection for further 
DCF resistance, secretion up to 10-fold greater levels of 
y-carboxylated Factor IX was observed compared to the 
original IC4 cell line. 

The coexpression of PACE (SEQ ID NO: 2) did not 
produce any detectable change in the size of the Factor 
IX protein as monitored by immunoprecipitation with a FIX 
antibody [Hybridtech] and SDS gel electrophoresis. 
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Numerous modifications and variations of the 
present invention are included in the above-identified 
specification and are expected to be obvious to one of 
skill in the art. Such modifications and alterations to 
5 the compositions and processes of the present invention 

are believed to be encompassed in the scope of the claims 
appended hereto. 



